On the top eigenvalue of heavy-tailed random matrices 
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We study the statistics of the largest eigenvalue A max of N x N random matrices with iid entries 
of variance 1/N, but with power-law tails P(My) ~ \Mij\~ 1 " 11 . When /i > 4, A max converges to 2 
with Tracy- Widom fluctuations of order iV~ 2// ' i , but with large finite N corrections. When fi < 4, 
A m ax is of order N 2 ^" 1 ^ 2 and is governed by Frechet statistics. The marginal case p = 4 provides 
a new class of limiting distribution that we compute explicitly. We extend these results to sample 
covariance matrices, and show that extreme events may cause the largest eigenvalue to significantly 
exceed the Marcenko-Pastur edge. 
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One of the most exciting recent result in mathematical 
physics is the Tracy- Widom distribution of the top eigen- 
value of large random matrices [l| . In itself, this result 
is remarkable since it constitutes one of the rare exactly 
soluble case in extreme value statistics for strongly cor- 
related random variables (the eigenvalues of a random 
matrix) , generalizing in a non trivial way the well known 
Gumbel-Fisher-Tippett, Weibull and Frechet cases 0- 
But the truly amazing circumstance is that the very same 
distribution appears in a host of physically important 
problems crystal shapes, exclusion processes Q, se- 
quence matching, directed polymers in random media, 
etc. The last case can in fact be considered, thanks to 
the mapping onto the Tracy- Widom problem, as an ex- 
actly soluble disordered system in finite dimensions, for 
which not only the scaling exponents but the full dis- 
tribution of the ground state energy can be completely 
characterized [5j. 

As for many limit theorems, the Tracy- Widom result 
is in fact expected to hold for a broad class of random 
matrices. The precise characterisation of this class, as 
well as the extension of the Tracy- Widom result for other 
classes, is a subject of intense activity 0, 0- It is al- 
ready known that for symmetric A x A matrices M with 
IID entries My of variance 1/N, such that all moments 
are finite, the Tracy- Widom result holds asymptotically 
. The case where the distribution of entries decays as 



a power-law 



Mi. 



(possibly multiplying a slow 



function) is expected to fall in a different universality 
class, at least when p is small enough. In the case p < 2 
where the variance of entries diverge, it is known that 
even the eigenvalue spectrum p(X) of M is no longer 
the Wigner semi-circle but itself acquires power-law tail 
p(X) ~ JAI~ 1— M , bequeathed from the tails of the matrix 
entries Correspondingly, the largest eigenvalues are 

described by Frechet statistics ^3 • What happens when 
p is in the range ]2, +00), such that the eigenvalue spec- 
trum p(X) still converges @, for large N, to the Wigner 
semi-circle? The aim of this letter is to discuss this prob- 
lem in details. We find that as soon as p > 4, the Tracy- 
Widom result holds asymptotically, albeit with large fi- 



nite size corrections that we compute. For p < 4, the 
largest eigenvalues are still ruled by Frechet statistics. 
The marginal case p = 4 provides a new class of limiting 
distribution that we compute explicitly. We then extend 
these results to the case of sample covariance Wishart 
matrices, for which power-law tailed elements are ex- 
tremely common, for example in financial applications 
[i"l| . Finally, the relation with directed polymers in the 
presence of power-law disorder is shortly addressed. 

We start by considering real symmetric matrices with 
IID elements of variance equal to 1/N, and such that the 
distribution has a tail decaying as: 
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where the tail amplitude insures that My's are of order 
AN" 1 / 2 . As soon as p > 2, the density of eigenval- 
ues converges to the Wigner semi-circle on the interval 
A G [—2,2], meaning that the probability to find an eigen- 
value beyond 2 goes to zero when TV — ► 00. However, 
this does not necessarily mean that the largest eigen- 
value tends to 2 - we will see below that this is only 
true when p > 4. In order to understand the statistics 
of the largest eigenvalues, we need first to study the fol- 
lowing auxiliary problem. Consider an N x N random 
matrix M with iid elements My ~ A -1 / 2 , such that its 
eigenvalue spectrum is, for large A, the Wigner semi- 
circle. Now, we perturb this matrix by adding a certain 
amount S to a given pair of matrix elements, say (a, (3): 
Map — > Mq/3 + S and Mp a — > Mp a + S. What can one 
say about the spectrum of this new matrix? There are 
several ways to solve this problem: self-consistent per- 
turbation theory (that we use below), free convolution 
methods or the replica method; the last two meth- 
ods in principle require some specific properties of matrix 
M, for example that M has Gaussian entries. However, 
the three methods give the same results for large A, as 
can be understood from general diagrammatic consider- 
ations (see e.g. ^3)- Self-consistent perturbation theory 
is rather straightforward and can be easily extended to 
other cases, such as Wishart matrices (see below). We 
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write down the eigenvalue equations as: 

AI hj Vj + M ii0l v a + M lt 0vp = Xvi] i ^ a,{3 (2) 



while for i = a, neglecting M a ^ compared to 5: 
^ M a jVj + Svp + M a , a v a = Xv ai 



(3) 



and similarly for i — p. We look for a special solution 
such that v a = vp = v* is of order unity, whereas all 
other vSs are of order iV -1 ' 2 . We assume (as will be 
self-consistently checked) that in the large -/V limit the 

terms J2i^ a ,/3 M a,iVi and J2j^ a ,/3 M Pd v 3 botn converge 
to Kv* , where K is a constant to be determined. As 
a consequence, from Eq. ©, A = 5 + K up to small 
corrections. One can now solve equation (J3J) to obtain: 



N-2 

E 



i 



S + K — rit 



wfwjiM^ + Mj^v*, (4) 



where rj£ and w\ are the eigenvalues and the eigenvectors 
of the N — 2 x N — 2 matrix obtained from M removing 
the rows and the columns a and (3. Using this expression, 
we can compute J2i^ a a M a ,iVi- Up to terms negligible 
in the large N limit one finds: 



M a<i Vi w Jdrjp w (r])- 



(5) 



where pw(v) IS > by assumption, the Wigner semicircle. 
Performing the integral over 77, the above self-consistency 
assumption finally leads to K+StV( k + S ) 2 ~ 4 = 2K - 
This equation for K only has a solution when |5| > 1, 
in which case K = 1/S and the corresponding eigenvalue 
of the perturbed matrix is A = 5+1/5 with |A| > 2, 
which is therefore expelled from the Wigner sea (see 
for a similar mechanism in the case of sample covariancc 
matrices, and flij for Hermitian random matrices). Note 
that these eigenvalues come in pairs, with A = —5—1/5, 
corresponding to v a = — vp = v* . When |5| < 1, one 
the other hand, no such eigenvalue exist, our assumption 
that there exists a localised eigenvector sensitive to the 
presence of 5 breaks down, and the edge of the spectrum 
remains A max = 2 in this case. One can in fact com- 
pute v* and characterize completely the corresponding 
localised eigenvector. Using Eq. 10} and imposing the 



normalisation condition 2v 



> 2 



E 



1 + Jdripwiv) 



(S+K- V y 



1 one finds: 



(6) 

showing that for large 5, as expected, the eigenvector 
completely localises on a and /3, whereas when |5| — > 1, 
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FIG. 1: Histogram of A max conditioned on \S\ < 0.95 for 
p — 5 for N — 200, 500, 1000 each eigenvalue has been shifted 
by the empirical mean and scaled by N 2 ^ 3 , for comparison 
a GOE Tracy- Widom distribution of zero mean and variance 
adjusted to match TV = 500 data is also shown (data obtained 
from [IrI'I. Similar agreement with Tracy- Widom and scaling 
in jV 2//a is obtained for any value of /i when conditioned on 
\S\ < 0.95. Note that for the parameters chosen here, the 
probability of \S\ > 1 is still quite large (75.2%) at N = 
1000. Even for such large values of N, the unconditional 
distribution of A max has a marked power-law tail of index p 
(dotted line) and is very different from the asymptotic Tracy- 
Widom distribution (inset). 



it "dissolves" over all sites - for |5| < 1 the perturbation 
is not strong enough to induce condensation. Eq. © 
enables one to compute various participation ratios of the 
eigenvectors, for example W4 = JT v f = (1 — 5~ 2 ) 2 /2. 

The above computation shows that adding any entry 
strictly less than unity (in absolute value) to a Wigner 
matrix does not affect the statistics of its largest eigen- 
value. There is in fact a stronger theorem, due to S. 
Peche , showing that the addition of any matrix of 
rank < eN, e — ► 0, with its largest eigenvalue A less 
that unity, leaves unchanged the statistics of the largest 
eigenvalue of a random Hermitian matrix. The mecha- 
nism leading to such a result is very similar to the one 
above; the largest eigenvalue of the resulting matrix is 
A + A" 1 when A > 1, and 2 otherwise. 

We can now come back to our initial problem, and de- 
fine the matrix M by removing all elements of M that 
are (in absolute value) larger than CN~ 1 ^ 2 , where C is 
finite, but as large as we wish, ft is clear that all the mo- 
ments of M are now finite; therefore the largest eigen- 
value of M has a Tracy- Widom distribution of width 
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TV" 2 / 3 around A max = 2 0, @. The number of 'large' 
entries that we have removed is, using Eq. (TJ, TV 2 e with 
e = (A/C)^ <C 1. Now, we should add back the en- 
tries that we have left out, starting by all those between 
CN~ X / 2 and 1~. Naively, each one of them leaves A max 
unchanged: one can dress M with all entries less that 1 
and still keep the largest eigenvalue Tracy- Widom. If the 
number of such elements was TVe (with e — ► 0), the Peche 
theorem would insure that this is true. Unfortunately, 
this number is rather TV 2 e, but all added entries are IID 
and randomly scattered over the matrix, and the Wigner 
semi-circle is preserved at each step. It is thus natural to 
conjecture that provided all these entries are strictly be- 
low unity, the largest eigenvalue remains Tracy- Widom. 
We have checked this numerically for different values of 
(j, (see Fig. 1). 
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We are now left with entries |My| > 1. From Eq. 
O, their number is N 2 P{M ij )dM ij = A^N 2 ~^ 2 . 
In the case \i > 4, it is clear that this number tends to 
zero when TV — * oo. With probability close to unity for 
large TV, no entry is larger than one, in which case the 
largest eigenvalue is Tracy- Widom. With small probabil- 
ity, the largest element S of M exceeds one; its distribu- 
tion is A^N" 2- ^ 2 /\S\ 1+t " and the corresponding largest 
eigenvalue, using the above analysis, is A max = S + 1/ S. 
For fj, > 4 and large but finite TV, we therefore expect 
that the distribution of the largest eigenvalue of M is 
Tracy- Widom, but with a power-law tail of index /j, that 
very slowly disappears when TV — > oo. Our numerical 
results are in full agreement with this expectation (see 
Fig. 1). When fj, < 4, on the other hand, the num- 
ber of large entries increases with TV. However, when /x 
is larger than 2, such as to insure that the eigenvalue 
spectrum still converges to the Wigner semi-circle, the 
number of row or columns where two such large entries 
appear still tends to zero, as TV 2_/i . Therefore, the above 
analysis still holds: for each large element Sij exceeding 
unity, one eigenvalue A = Sij + S^ will pop out of the 
Wigner sea. Even if the eigenvalue density tends to zero 
outside of the interval [—2, 2] when 2 < fi < 4, the num- 
ber of eigenvalues exceeding 2 (in absolute value) grows 
as TV 2 ~ M / 2 <C TV. The k largest entries are well known 
to be given by a Poisson point process with Frechet in- 
tensity 0b the order of magnitude of the kih largest 
entry is AN 2 /**- 1 / 2 /k 1 ^ which diverges with TV, such 
that in this regime the eigenvectors become strictly lo- 
calized (v* = ±l/\/2). The largest eigenvalues are then 
equal to the largest entries and are themselves given by 
a Poisson point process with Frechet intensity, as proven 
by Soshnikov in the case \i < 2 [lj} . His result therefore 
holds in the whole range // < 4. Finally, the marginal 
case /x = 4 is easy to understand from the above discus- 
sion. The number of entries exceeding one remains of 
order unity as TV — > oo; the distribution of the largest 



FIG. 2: Histogram of A max for /i = 4 for matrices of size 1000 
and 2000. The solid line shows the transformed Frechet dis- 
tribution with the tail amplitude A set to the value used in 
simulation. Note that at A = 2, there should be a Dirac delta 
which is impossible to distinguish at this scale from the inte- 
grable singularity at A = 2 + . Top inset: Scatter plot of largest 
eigenvalue (A max ) vs largest absolute element (\S\). Theory 
predicts A max = 2 with TV~ 1//3 fluctuations for \S\ < 1 and 
A max = \S\ + 1/\S\ with Tv~ 1/2 fluctuation for |5| > 1. Bot- 
tom inset: Scatter plot of the inverse participation ratio (W4) 
of the top eigenvector vs \S\ compared to the prediction from 
Eq. © (bottom inset). Similar scatter plots were obtained 
for other values of a. 



entry S is Frechet with TV-independent parameters: 



P^(\S\) 



4A 4 



■ exp 



A 4 
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(7) 
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The probability that \S\ exceeds 1 is ip = 1 — e 
in which case A max = \S\ + I5*! -1 ; otherwise, with prob- 
ability 1 — ip, A max = 2. This characterizes entirely the 
asymptotic distribution of the largest eigenvalue in the 
marginal case \i = 4: it is a mixture of a <5-peak at 2 and a 
transformed Frechet distribution. Note that this asymp- 
totic distribution is non-universal since it depends explic- 
itly on the tail amplitude A. Our argument also predicts 
the structure of the eigenvector when \S\ is finite, see Eq. 
JHJ. Again, all these results are convincingly borne out 
by numerical simulations, sec Fig. 2. The statistics of 
the second, third, etc. eigenvalues could be understood 
along the same lines. 

We now turn to the case of sample covariance matri- 
ces, important in many different contexts. The 'bench- 
mark' spectrum of sample covariance matrix for iid Gaus- 
sian random variables is well known, and given by the 
Marcenko-Pastur distribution 0|. Here again, the spec- 
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trum has a well defined upper edge, and the distribution 
of the largest eigenvalue is Tracy- Widom (see e.g. Q). 
What happens if the random variables have heavy tails? 
More precisely, we consider N times series of length T 
each, denoted x\, where i = l,..,N and t = 1,...,T. 
The x\ have zero mean and unit variance, but may have 
power-law tails with exponent \i. For example, daily 
stock returns are believed to have heavy tails with an 
exponent /i in the range 3 — 5 . The empirical covari- 
ance matrix C is defined as: 



_ \ " * * 



(8) 



When the time series arc independent, and for T and N 
both diverging with a fixed ratio Q = T/N, the eigenval- 
ues of C are distributed in the interval [(1 — Q" 1 / 2 ) 2 , (1 + 
Q~V 2 ) 2 ]. when T — > oo at fixed N, all eigenvalues 
tend to unity, as they should since the empirical covari- 
ance matrix converges to its theoretical value, the iden- 
tity matrix. When N and T are large but finite, the 
largest eigenvalue of C is, for Gaussian returns, a dis- 
tance ~ TV -2 / 3 away from the Marcenko-Pastur edge, 
with Tracy- Widom fluctuations. When returns are ac- 
cidentally large, this may cause spurious apparent cor- 
relations and substantial overestimation of the largest 
eigenvalue of C. Let us be more specific and assume, 
as above, that one particular return, say x T a , is excep- 
tional large, equal to S. A generalisation of the above 
self-consistent perturbation theory, or free convolution 
methods, shows that whenever S < (NT) 1 / 4 , the largest 
eigenvalue remains stuck at A max = (1+Q^ 1 ' 2 ) 2 , whereas 
when S > (NT) 1 / 4 , the largest eigenvalue becomes: 
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(9) 



This result again enables us to understand the statistics 
of A max as a function of the tail exponent /1. For N 
times series of iid random variables, of length T each, 
the largest element is of order (NT) 1 ^. For [i > 4, this 
is much smaller than (NT) 1 / 4 and, exactly as above, we 
expect the largest eigenvalue of C to be Tracy- Widom, 
with possibly large finite size corrections 01- For /i < 4, 
large 'spikes' in the time series dominate the top eigen- 
values, which arc of order A max ~ n*/h-1q2/»-i an d 
distributed according to a Frcchet distribution of index 
/Lt/2. For applications to financial data, reasonable num- 
bers for intraday data are (j, = 3, N = 500 and Q = 2, 
leading to A max « 8, compared to the Marcenko-Pastur 
edge located at 2.914. This shows that the effect can 
indeed lead to anomalously large eigenvalues with no in- 
formation content. In the marginal case fj. = 4, as above, 
Amax has a finite probability to be equal to the Marcenko- 
Pastur value, and with the complementary probability it 
is distributed according to a transformed Frcchet distri- 
bution of index 2, with a T and N independent scale. 



The structure of the corresponding eigenvectors can also 
be investigated and is again found to be partly localized 
when S > (NT) 1 / 4 . Finally, we expect similar results 
to hold for the Random Singular Value problem stud- 
ied in 18], where rectangular matrices corresponding to 
cross correlations between different sets of time series are 
considered. 

As mentioned in the introduction, the Tracy- Widom 
distribution for the largest eigenvalue of complex sample 
covariance matrices has deep links with the directed poly- 
mer problem in (1+1) dimension Q. A naive guess would 
therefore be that the universality class of the ground state 
energy changes whenever the disorder of the directed 
polymer problem has fat tails with an exponent [i < 4. 
This is in fact not correct, at least in the version of the 
directed polymer problem where each site carries a ran- 
dom iid energy. In this case, simple Flory type arguments 
[lflj suggest that the universality class in fact changes as 
soon as ^ < 5. More precisely, the energy fluctuations 
should scale as N 1 / 3 and by Tracy- Widom for fj, > 5, 
and as N 3 /^' 1 ) for 2 < /i < 5 with a new type of lim- 
iting distribution (the case fj, < 2 corresponds to a com- 
plete stretching of the polymer and was recently solved 
in |2fij). We have conducted new numerical simulations 
of this problem which indeed confirm that for \i > 5, the 
ground state energy scales as N 1 / 3 with Tracy- Widom 
fluctuations, while for fi < 5 the above Flory prediction 
seems correct. The distribution P of ground state energy 
can be fitted by a geometric convolution of Frechet distri- 
butions: P= (l-p)(F+pF-kF+p 2 F-kF-kF+...), different 
from the pure Frechet distributed reported above for the 
largest eigenvalue for /1 < 4. Following 0, the correct 
mapping should in fact be onto a directed polymer with 
power-law columnar disorder. We leave this for further 
detailed investigations. 

In summary, we have analyzed the statistics of the 
largest eigenvalue of heavy tailed random matrices. We 
have shown that as soon as the entries have finite fourth- 
moment, the largest eigenvalue has Tracy- Widom fluc- 
tuations, whereas if the fourth-moment is infinite, the 
largest eigenvalue diverges with the size of the matrix 
and has Frechet fluctuations. In the marginal case where 
the fourth-moment only diverges logarithmically, the dis- 
tribution is a non-universal mixture of a delta peak and 
a modified Frechet law. The structure of the associated 
eigenvector evolves from being completely delocalized in 
the Tracy- Widom case, to partially or totally localized 
in the Frechet case. We have shown that similar results 
holds for sample covariance matrices, and that extreme 
events may cause the largest empirical eigenvalue to sig- 
nificantly exceed the Marcenko-Pastur edge. 
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